Approximate Dynamic Programming: An Efficient Machine Learning Algorithm

نویسندگان

چکیده

We propose an efficient machine learning algorithm for two-stage stochastic programs. This is termed as projected hybrid algorithm, and consists of sub-gradient piecewise linear approximation methods. use the sample information to update on objective function. Then we introduce a projection step, which implemented methods, jump out from local optimum, so that can achieve global optimum. By innovative show convergent property general Furthermore, network recourse problem, our drop steps, but still maintains convergence property. Thus, if properly construct initial functions, pure method The proposed approximate dynamic programming overcomes high dimensional state variables using methods learning, its logic capture critical ability structure anticipate impact decisions now future. optimization framework, carefully calibrated against historical performance, make it possible changes in collective intelligence experienced decisions. Computational results indicate exhibits rapid convergence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient sampling in approximate dynamic programming algorithms

Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimens...

متن کامل

An Approximate Dynamic Programming Algorithm for Monotone Value Functions

Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However, use of a template does not certify that the paper has been accepted for publication in the named journal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print ...

متن کامل

Dynamic Programming for Approximate Expansion Algorithm

Expansion algorithm is a popular optimization method for labeling problems. For many common energies, each expansion step can be optimally solved with a min-cut/max flow algorithm. While the observed performance of max-flow for the expansion algorithm is fast, its theoretical time complexity is worse than linear in the number of pixels. Recently, Dynamic Programming (DP) was shown to be useful ...

متن کامل

Optimal Learning and Approximate Dynamic Programming

Approximate dynamic programming (ADP) has emerged as a powerful tool for tackling a diverse collection of stochastic optimization problems. Reflecting the wide diversity of problems, ADP (including research under names such as reinforcement learning, adaptive dynamic programming and neuro-dynamic programming) has become an umbrella for a wide range of algorithmic strategies. Most of these invol...

متن کامل

Approximate Dynamic Programming and Reinforcement Learning

Dynamic programming (DP) and reinforcement learning (RL) can be used to address problems from a variety of fields, including automatic control, artificial intelligence, operations research, and economy. Many problems in these fields are described by continuous variables, whereas DP and RL can find exact solutions only in the discrete case. Therefore, approximation is essential in practical DP a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial intelligence

سال: 2023

ISSN: ['2633-1403']

DOI: https://doi.org/10.5772/intechopen.106691